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Abstract 

What additional gates are needed for a set of classical universal gates to do universal 
quantum computation? We answer this question by proving that any single-qubit real gate 
suffices, except those that preserve the computational basis. 

The result of Gottesman and Knill Q implies that any quantum circuit involving only 
the Controlled-NOT and Hadamard gates can be efficiently simulated by a classical cir- 
cuit. In contrast, we prove that Controlled-NOT plus any single-qubit real gate that does 
not preserve the computational basis and is not Hadamard (or its alike) are universal for 
quantum computing. 

Previously only a "generic" gate, namely a rotation by an angle incommensurate with 
vr, is known to be sufficient in both problems, if only one single-qubit gate is added. 
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1 Introduction 



A set of quantum gates G (also called a basis) is said to be universal for quantum com- 
putation if any unitary operator can be approximated with arbitrary precision by a circuit 
involving only those gates (called a G-circuit). Since complex numbers do not help in quan- 
tum computation, we also call a set of real gates universal if it approximates arbitrary real 
orthogonal operators. 

Which set of gates are universal for quantum computation? This basic question is im- 
portant both in understanding the power of quantum computing and in the physical imple- 
mentations of quantum computers, and has been studied extensively. Examples of universal 
bases are: (1) Toffoli, Hadamard, and ^-gate, due to Kitaev Q (2) CNOT, Hadamard, and 
^-gate, due to Boykin, Mor, Pulver, Roychowdhury, and Vatan [|[, and (3) CNOT plus the 
set of all single-qubit gate, due to Barenco, Bennett, Cleve, DiVincenzo, Margolus, Shor, 
Sleator, Smolin, and Weinfurter [0. 

Another basic question in understanding quantum computation is. Where does the power 
of quantum computing come from? Motivated by this question, we rephrase the universality 
question as follows: Suppose a set of gates G already contains universal classical gates, and 
thus can do universal classical computation, what additional quantum gate(s) does it need 
to do universal quantum computation? Are there some gates that are more "quantum" 
than some others in bringing more computational power? 

Without loss of generality, we assume that G contains the Toffoli gate, since it is universal 
for classical computation. The above three examples of universal bases provide some answers 
to this question. It is clear that we need at least one additional gate that does not preserve 
the computational basis. Let us call such a gate basis- changing. Our main result is that 
essentially the basis-changing condition is the only condition we need: 

Theorem 1.1. The Toffoli gate and any basis- changing single-qubit real gate are universal 
for quantum computing. 

The beautiful Gottesman-Knill Theorem |p implies that any circuit involving CNOT 
and Hadamard only can be simulated efficiently by a classical circuit. It is natural to ask 
what if Hadamard is replaced by some other gate. We know that if this replacement R is 
a rotation by an irrational (in degrees) angle, then R itself generates a dense subset of all 
rotations, and thus is universal together with CNOT, by Barenco et al. j^]. What if the 
replacement is a rotation of rational angles? We show that Hadamard and its alike are the 
only exceptions for a basis-changing single-qubit real gate, in conjunction with CNOT, to 
be universal. 

Theorem 1.2. Let T be a single-qubit real gate and T^ does not preserve the computational 
basis. Then {C'NOT,T} is universal for quantum computing. 

A basis is said to be complete if it generates a dense subgroup of U (k) modular a phase, 
or 0{k) for some k >2. We actually prove that each of the two bases in the above theorems 
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gives rise to a complete basis. By the fundamental theorem of Kitaev ^ and Solovay 
any complete basis can efficiently approximate any gate (modular a phase), or real gate 
if the basis is real. Therefore, any real gate can be approximated with precision e using 
polylog{j) gates from either basis, and any circuit over any basis can be simulated with 
little blow-up in the size. 

We also provide an alternative prove for theorem LI by directly constructing the ap- 
proximation circuit for an arbitrary real single-qubit gate, instead of using Kitaev-Solovay 
theorem. The drawback of this construction is that the approximation is polynomial in 
however, it is conceptually simpler, and uses some new idea that does not seem to have 
appeared before (for example, in the approximation for Control-sign- flip) . 

There is a broader concept of universality based on computations on encoded qubits, 
that is, fault-tolerant quantum computing. We do not discuss this type of computation, an 
interested reader is referred to the survey of Preskill For a more detailed reference to 
related works, refer to the book of Nielsen and Chuang M . 



2 Preliminary 

Denote the set {1, 2, • • • , n} by [n]. We will mostly follow the notations and definitions from 
the book by Kitaev, Shen, and Vyalyi |^. 

The (pure) state of a quantum system is a unit vector in its state space. The state space 
of one quantum bit, or qubit, is the two dimensional complex Hilbert space, denoted by Ti.. 
A prechosen orthonormal basis of Ti. is called the computational basis and is denoted by 
{|0), |1)}. The state space of a set of n qubits is the tensor product of the state space of 
each qubit, and the computational basis is denoted by 

{\b) = \bi)(g) I62) • • • » \bn) ■■ b = 6162 • • • 6„ G {0, 1}"} . 

A gate is a unitary operator U G \J{7i'^'^), for some integer r > 0. For an ordered subset A 
of a set of n qubits, we write U [A] to denote applying U to the state space of those qubits. 
A set of gates is also called a basis. A quantum circuit over a basis G, or a G-circuit, on n 
qubits and of size m is a sequence Ui[^i], U2[^2]; • • • , Um[^m]! where each Uj G G and 
Ai C [n]. Sometimes we use the same notation for a circuit and for the unitary operator 
that it defines. The following definition generalizes the definition in Q. 

Definition 2.1. The operator U : Ti^^ -j-^^^r jg approximated by the operator U : "H®^ — > 
T-C^^ using the ancilla state |^) G T^t^^A^-r fo^, arbitrary vector |0 G 



u(|0 ® 1^)) - u|0 ^ 1^) 



< e 



Let G be a basis. A G-ancilla state, or an ancilla state when G is understood, oii qubits 
is a state A|5), for some G-circuit A and some b G {0, 1}^. A basis G is said to be universal 
for quantum computing if any gate (modular a phase), or any real gate when each gate in G 
is real, can be approximated with arbitrary precisions by G-circuits using G-ancillae. By a 
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phase, we mean the set {exp(za) : a G R}. The basis is said to be complete if it generates a 
dense subgroup of U{k) modular a phase, or 0{k) when its real for some k>2. A. complete 
basis is clearly universal. 

Now we introduce the standard notations for some gates we shall use later. Denote 
the identity operator on by I. We often identify a unitary operator by its action on the 
computational basis. The Pauli operators and cj^, and the Hadamard gate H are 

'^^^(»-")' "-^(:-0- 

If U is a gate on r qubits, for some r > (when r = 0, U is a phase factor), A'^(U) is the 
gate on fc + r qubits that applies U to the last r qubits if and only if the first k qubits are 
in The superscript k is omitted ii k = 1. Changing the control condition to be |0)®'^, 

we obtain A''(U). The Toffoli gate is A2(c7^), and CNOT is A(cr^). Evidently the latter can 
be realized by the former. From now on we only consider real gates. As in the previous 
section, a gate g is said to be 6asis-changing if it does not preserve the computational basis. 



3 Completeness proofs 



In this section we prove the following theorems, from which Theorem 1.2 and Theorem 1.1 
follow immediately. 

Theorem 3.1. Let S be any single-qubit real gate that is basis- changing after squaring. 
Then {CNOT, S} is complete. 



Theorem 3.2. The set {A^((T^),i/} is complete. 

We need the following two lemmas, which fortunately have been proved. 

Lemma 3.3 (Wlodarski Q). // a is not an integer multiple ofir/A, and cos /? = cos^ a, 
then either a or (3 is an irrational multiple o/tt. 



Lemma 3.4 (Kitaev [^). Let M. be a Hilbert space of dimension > 3, |.^) G a unit 
vector, and H C SO{Ai) be the stabilizer of the subspace M(|.^)). If \/ £ 0{M.) does not 
preserve M(|^)), Hljy^^HW generates a dense subgroup of SO{Ai) . 



Proof of Theorem 3.1. Define 

U := (S«)S- A((T^)[1,2])2. 

It suffices to prove that U and A((T^) generate a dense subgroup of 50(4). Without loss 
of generality, we assume that U is a rotation by an angle 9, the other case can be proved 
similarly. Then by the assumption, 9 is not an integer multiple of 7r/4. 



4 



Direct calculation shows that U has eigenvalues {1, 1, exp(ibza)}, where 

a = 2 arccos cos^ 6. 
The two eigenvectors with eigenvalue 1 are 

16) :=^(|00)-|01) + |10) + |11)), 

and 

16) :=5^(-|0) + |l)) + ^2^(|0)-|l». 

Let ll^j) : i G [4]} be a set of orthonormal vectors. 

By Lemma |3.3| , a is incommensurate with vr, therefore, U generates a dense subgroup 
of Hi := ^©(spanll^s), 1^4)}). Note that A((t^')[1,2] preserve but not span{|^2)}- 
Therefore, by Lemma the set 

lfi|jA(a-)[l,2] /fi A(a-)[1,2] 

generates a dense subgroup of 

50(span{|6) :i = 2,3,4}) =: H2, 

thus so does {U, A(cr^')[l, 2]}. Finally, observe that A(cr^)[2, 1] does not preserve span{|^i)}, 
therefore, apply Lemma again we conclude that {U, A(c7^)[l, 2], A(c7^)[2, 1]} generates a 
dense subgroup of 50(4). 



Proof of Theorem 3.i. Define 



U := A2(fj^) [1,2,3]) ^ 



Direct calculation shows that U has eigenvalue 1 with multiplicity 6, and the other two 

3 
4- 



eigenvalues \± := exp(ibia), where a = vr — arccos |. Since \± are roots of the irreducible 
polynomial 



which is not integral, therefore \± are not algebraic integers. Thus a is incommensurate with 
vr, which implies that U generates a dense subgroup of the rotations over the corresponding 
eigenspace (denote the eigenvectors by 1^7) and l^s))- 

By direct calculation, the eigenvectors correspond to eigenvalue 1 are: 

{|000), |010), |100), |001) + |011), |101) + |110) + |111), |011) - |101)} . 
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Label the above eigenvectors by i S [6]. It is easy to verify that each Uj, i S [6], 
constructed below preserves : 1 < J < but not span{|,^j)}. 

Ui := I I U2 := Ui • A2(a^')[2, 3, 1] • Ui, 

U3 := Ui • A2(a")[l, 3, 2] • Ui, U4 := A'K)[2, 3, 1], 

U5 := Ui • A2K)[2,3,1] • Ui, Ue := A2(c7-')[1, 3, 2]. 

Applying Lemma |3^ several times, we see that {U, Uj, Uj+i, • • • , Ug} generates a dense 
subgroup of span{|^j) : i < j < 8}. Thus {A^(cj^),i/| generates a dense subgroup of 
50(8). We leave the details for the interested readers. 

4 Alternative proof for Theorem 

Fix an arbitrary basis-changing real single-qubit gate S, and the basis 

^■.= {S,A\a-)}. 

In this section we give an explicit construction to approximate an arbitrary real gate using 
the basis Due to the following result by Barenco et al. 0, we need only consider 
approximating single-qubit real gates: 

Proposition 4.1 (Barenco et al. ||l|). Any gate on r qubits can be realized by 0(r^4'') 
CNOT and single-qubit gates. 

Fix an arbitrary single-qubit gate W that we would like to approximate. Without loss 
of generality, we can assume that S and W are rotations, for otherwise a^S and cr^W are. 
For any j3 G [0, 27r), define 

:= cos/3|0) -|- sin/3|l), and, U/3 := 

Let 6,a £ [0, 2tt), and 6 not an integral multiple of vr/2, be such that S = and W = Uq,. 
The following proposition can be easily checked. 

Proposition 4.2. Let be a gate on k + 1 qubits such that Wq,/2|0)'^^'''"^ = \4>a/2) ^ 

\0f\ With 

W„:=W„/2(-A'=+^(-l))Wt/2^^[l]' (1) 

for any vector |0 G 7i, 

Ua|O®|0)^' = W„(|e)®|0)®'). (2) 

Clearly A'=+i(-l) can be realized by A'^^tj*^) and c^. Therefore, to approximate Uq,, 
it suffices to approximate and which we will show in the following subsections. 

Define the constants 

6e:=l/\og J' and, 5'q := l/\og—^. 

cos^ 9 + sm* 9 cos^ 9 



cos (5 
sin/3 
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4.1 Approximating cr^ 



If is a multiple of 7r/4, say 6 = '7r/4, then we can easily do a sign- flip by applying a 
bit-flip on Ue|l) = 3^|0) + But for a general 9, Ue|l) = -sin6'|0) + cos6l|l) is 

"biased" . Immediately comes into mind is the well-known idea of von Neumann on how to 
approximate a fair coin by tossing a sequence of coins of identical bia^. To illustrate the 
idea, consider 

U^IO) O Ue|l) = sin6lcos6l(|ll) - |00)) + cos^ 6l|01) - sin^ 6l|10). 

If we switch |00) and |11) and leave the other two base vectors unchanged, the first term 
on the right-hand side changes the sign, while the remaining two terms are unchanged. 
While we continue tossing pairs of "quantum coins" and do the |00)-and-|ll) switch, we 
approximate the sign-flip very quickly. The state defined below will serve the role of -^ll) — 

Definition 4.3. For any integer A; > 0, the phase ancilla of size k is the state 

\^k) :=(U,|0)®Ue|l)^^ 

Clearly 1$^) can be prepared from lo)*^^*"' by a *B-circuit of size 0{k). 

Lemma 4.4. The operator cr^ can he approximated with precision e, for any e > 0, hy a 
'^-circuit of size 0{k), using the phase ancilla |<5fc), for some integer k = O [6g log ^) . 

Proof. Let k be an integer to be determined later. The following algorithm is a description 

of a circuit approximating using \^k)- 

Algoritiim 1. A IB-circuit approximating using the phase ancilla \^k)- 
Let |6o) |6) be a computational base vector, where bo £ {0, 1} is the qubit to which 
is to applied, and b = 6i6'j^62^2 ' ' ' ^fc^fc ^ i*-*' ^^'^'^ ancilla qubits. Condition 

on bo (that is, if 6o = 0, do nothing, otherwise do the following). 

Case 1: There is no i such that bi (B b[ = 0, do nothing. 

Case 2: Let i be the smallest index such that bi®b^ = 0, flip bi and 6^. 

Clearly the above algorithm can be carried out by 0{k) applications of Toffoli. Fix an 
arbitrary unit vector |^) G 7i. Since neither cr^ nor changes |0)(0|(|^) |*5fc)), 

h'\e)®\^k)-~^'m®\^k))\\ < \\-\i)®\^k)-d\\i)(^\^k))\\. (3) 

Let \'^^) (I'J'fc )) be the projection of to the subspace spanned by the base vectors 
satisfying Case (1) (Case (2)), it is easy to prove by induction that 

a\\l)(E)\^t)) = \l)®\^ t)^ and, ~a\\l) ® = ® . 

^That is, toss two coins, declare "0" if the outcomes are "01", declare "1" if the outcomes are "10", and 
continue the process otherwise. 
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Furthermore, 



Therefore, the left-hand side of Equation ^ is upper bounded by 

2|||$+)|| =2(cos^0 + sin4e)'^/'. 

Since 9 is not a multiple of 7r/2, the right-hand side is < 1. Thus choosing k = 0{5g log i), 
the right-hand side in the above can be made < e. □ 

4.2 Creating |0q/2) 

We would like to construct a circuit that maps \Q) to a state close to \4>a/2) ®\^)^^ ■ 

The main idea is to create a "logical" 1 00-/2 )■ 



60/2) := cos-|0) -Fsm-|l), 



(4) 



2 ' ' 2 

where |o) and |l) are two orthonormal vectors in a larger space spanned by ancillae, and 
then undo the encoding to come back to the computational basis. To create 4'a/2^y we first 
create a state almost orthogonal to |o), and then apply Grover's algorithm Q to rotate this 
state toward (f>a/2^- Define the operator Tg on 2 qubits as 

T, :=U.e[l] AK)[1,2] U,[l]. (5) 
Since for any /3, U_/3 = a^U^a^, Tg and A.{Tg) can be realized by the basis !B. Let 

!Bi := {A2(a^),cT^U,,U_e,T,,A(Te)}. 

Lemma 4.5. For any e > there exists a ^i-circuit Wq,/2 of size 0{6'g^log^) that uses 
0{6glog^) ancillae and satisfies 



Wo/2|0)^'=+l-|<Aa/2)®|0) 



< e. 



Proof. Figure illustrates our proof. Let A; > be an integer to be specified later. Define 



|0)' 



t,2k 



Tf 16), 



and, 



7 := arcsm cos 



2k . 



Notice that 7r/2 — 7 is the angle between |6) and |i), and < 7 < vr/2, since sin7 = l). 
Let S be the plane spanned by |o) and |i) Let |l) be the unit vector perpendicular to |o) 
in S and the angle between |l) and |l) is 7. 

Observe that on S we can do the reflection along |l) and the reflection along The 
former is simply A^'^(ct^), which can be implemented using A^((T^) and . Since T^ ^ = Tg, 
the reflection along |i) is 
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Without loss of generality we can assume a/2 < 7r/2; otherwise we will rotate |i^ close to 
A^^((T^) ^Q,/2^ and then apply h?^{u^). Choose k sufficiently large so that 7<7r/2 — a/2. 

Now we can apply Grover's algorithm to rotate |i) to a state very close to 4'a/2^- After 
that we do a "controUed-roU-back" to map |i) (approximately) to and does not change 
|0). This will give us an approximation of [(/>„/2) in the state space of the controlling qubit. 
The algorithm is as follows. Let T be the integer such that |7r/2 — (2T + 1)7 — a/2| < 7. 
Then r = 0(1/7). 

Algorithm 2. A *Bi-circuit \Na/2 that maps |0) (g) |0)®^'^ to a state close to \(Pa/2 



|0)^ 



1. Apply I Tf ^. 

2. (Grover's algorithm) Apply (R A^'^(c7^))' 

3. (Sub-circuit A3) For a computational base vector \b) of the ancillae, if |6) 7^ |0), 
flip the first bit. 

4. (Sub-circuit A4) Use the first bit as the condition bit, apply A(T|''^). 



It can be easily verified that 



\f^a/2i\0)^\0fn-\^a/2)^\0) 



i2k 



< 27. 



Setting 7 f» e/2, by direct computation the number of ancillae is 0{k) = 0(5^ log ^), and 
the size of W^/2 is 0(^/7) = 0((5^i log i). 

□ 
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4.3 Approximating Uc 

Theorem is a straightforward corollary of the following theorem and Proposition |4.1| . 

Theorem 4.6. For any e > 0, the operator Uq, can he approximated with precision e by a 
'^-circuit of size O (Jg • | • log \) and using O [5g ■ log ^) ancillae. 

Proof. We first compose a *Bi-circuit that approximates Uq,, according to Equation [l|, 
and Algorithm and use ki (different) ancillae in each call to the latter, for an integer ki 
to be specified later. Let 7 := cos^^^^ 9. Then the precision is 0(7). After implementing Tg 
and A{Tg), there are in total O(^) uses of a^. 

Finally we apply Algorithm |l] to approximate each using the same phase ancilla 
l^fcj) for k2 = 0(1/7^). Let 5g := 2(cos'^0 + sin^0)'^2/2 error of one call to using 

exactly |^'fc2)- Observe that using the same phase ancilla for O(^) times causes error at most 

1 + 2H \-0{^)-l = 0{^) Setting 6g = j^, the total error caused by is 0(7). Thus the 

total error of the whole circuit is still 0(7). Setting 7 f« e, A;i = 0{6glog^) = 0((5elogi) 
and ^2 = 0{6glog^). Therefore the number of ancillae is 0{ki + ^2) = 0(50 log i). The 
size of the circuit is 0{{ki + k2)^) = 0{6g^ log 

□ 
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